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Abstract 

Wireless sensor networks are often modeled in terms of a dense deployment of smart sensor nodes 
in a two-dimensional region. Given a node deployment, the critical geometric graph (CGG) over these 
locations (i.e., the connected geometric graph (GG) with the smallest radius) is a useful structure since 
it provides the most accurate proportionality between hop-count and Euclidean distance. Hence, it can be 
used for GPS-free node localisation as well as minimum distance packet forwarding. It is also known to 
be asymptotically optimal for network transport capacity and power efficiency. In this context, we propose 
DISCRIT, a distributed and asynchronous algorithm for obtaining an approximation of the CGG on the 
node locations. The algorithm does not require the knowledge of node locations or internode distances, nor 
does it require pair-wise RSSI (Received Signal Strength Indication) measurements to be made. Instead, the 
algorithm makes use of successful Hello receipt counts (obtained during a Hello-protocol-based neighbour 
discovery process) as edge weights, along with a simple distributed min-max computation algorithm. 

In this paper, we first provide the theory for justifying the use of the above edge weights. Then we 
provide extensive simulation results to demonstrate the efficacy of DISCRIT in obtaining an approximation 
of the CGG. Finally, we show how the CGG obtained from DISCRIT performs when used in certain network 
self-organisation algorithms. 

I. Introduction 

In a wireless sensor network, the smart sensor nodes (often called motes) are embedded in some space, 
region or structure in order to make measurements and to draw inferences. In this paper, we are concerned 
with situations in which there is a dense deployment of nodes over a 2-dimensional region. We propose 
and study an algorithm, called DISCRIT, for the distributed construction of an approximation to the critical 
geometric graph (CGG) over the set of node locations. 

Notation: We denote the region of deployment by A G R 2 . Given a set of n nodes N = {1, 2, . . . , n} and 
the node location vector V = (V\, V2, ■ ■ ■ , V n ) G A n , the geometric graph (GG) of radius r, denoted by 
Q(V,r), is obtained by joining any two nodes within a distance of r. That is, the edge set E of G(V, r) is 
given by E = G -/V 2 : di j < r}, where dij represents the Euclidean distance between nodes i and 

j. A sufficiently small r can cause the resulting GG to be disconnected. Thus, the smallest r on the given 
node locations at which the corresponding GG becomes connected is called the critical radius, denoted as 



r cr it(V). We will call the corresponding GG, £7(V, r cr it(V)), the critical geometric graph (CGG) on V. 
When the node locations are implicit, we denote the critical radius as r cr j t and the CGG as Gcrit- 

Motivation for Gcrit- The following are some reasons why it would be useful to develop a distributed 
algorithm for obtaining the CGG on a given set of node locations. 

1) In most applications of wireless sensor networks, it is important for the nodes to be aware of their own 
locations and distances to other nodes (in particular the network's fusion centers, or base stations). 
While accurate and low-power Global Positioning System (GPS) support is becoming available on 
motes, such an approach may not only be expensive and power hungry, but also, not workable for 
all applications, for example in large indoor areas or in mines. One approach for GPS-free distance 
estimation and node localisation is to overlay a geometric graph on the node locations. It can then 
be shown that the number of edges along the shortest path between an anchor point on the plane 
and a node is roughly proportional to the Euclidean distance between the point and the node, the 
proportionality factor being the radius of the GG (see UJ). The usual approach for obtaining a GG on 
the set of node locations (e.g., and Q) yields a radius equal to the radio-range of the nodes. Since 
the CGG is the connected geometric graph (on the given node locations) with the smallest radius, it 
provides the "finest scale" with which to measure distances using hop-counts on a GG on the nodes. 

2) Further, we observe that, since hop-counts on the CGG provide a measure of Euclidean distance, 
these hop-counts can also be used in topology-free routing. For example, a node that has a smaller 
hop-count (on the CGG) to the base-station is also likely to be closer to the base-station, and hence 
can serve as a forwarding node in a topology-free routing algorithm (see H). 

3) We also recall that, under the setup used by Gupta and Kumar Q, Q cr it turns out to be an asymptotically 
optimal topology for maximising the transport capacity of the network, and minimizing the network 
power consumption. 

Remark: Note that while Point 3) above suggests the use of Q cr it itself as the communication topology, in 
Points 1) and 2) Q cr it is used only as a means for obtaining distance measurements on the region on which 
the nodes are deployed. 

System assumptions: To obtain a distributed algorithm to approximate Gcrit, we make the following 
assumptions. We consider a dense node deployment, where the nodes are deployed in excess of the minimum 
requirement for connectivity and sensing coverage. For the purpose of developing the theory, we consider the 
uniform i. i. d. deployment where each node is placed randomly uniformly on the region A, independent of 
the placement of the other nodes. However, simulation results are shown for the randomised lattice and grid 
deployments as well. In a randomised lattice deployment, the region A is divided into n partitions of equal 
area called cells, and one node is placed randomly over each cell. The grid is a deterministic deployment 
where the nodes are placed in -y/n rows and y/n columns on A, with the rows and columns equally spaced. 

The channel model includes path loss, and fading with additive Gaussian noise, where the fading process 
is assumed to be stationary in space and time with a common marginal distribution. The terrain is assumed 
to be flat, and the node transmission is assumed to be omnidirectional, so that the power radiated in all 
directions is equal. 

Contributions: With the above assumptions, our contributions are the following. We develop DISCRIT 
(Distributed CRITical geometric graph algorithm), a distributed algorithm for constructing an approximation 
to Qcrit- The algorithm is based on a result due to Penrose which holds for uniform i. i. d. deployments, 
and states that as the number of nodes n — > oo the CGG becomes the same as the farthest nearest neighbour 
geometric graph (FNNGG). Given the internode distances, the FNNGG can easily be constructed by a 



distributed max-min computation, thus providing an approximation to the CGG, for large n. Since we do 
not know internode distances, we utilise a technique that provides us with a monotone decreasing function 
of the internode distances, thus permitting the use of a distributed min-max computation to obtain the 
FNNGG. Such a function is obtained by using Hello reception counts obtained during the Hello-protocol- 
based neighbour discovery (see (7|). The Hello transmissions can proceed completely asynchronously (e.g., 
via CSMA broadcasts), thus not requiring transmission synchronisation, as might be necessary in an RSSI- 
based approach. We show theoretically (using our assumptions, above) that the counts we obtain can serve 
as surrogates for the internode distances directly in the special case of isotropic antenna radiation patterns. 
Then we provide extensive simulations results to support this theory and our overall DISCRIT proposal. 

Finally, we provide numerical evaluations of two applications of the approximate CGG obtained from 
DISCRIT: 

1) Optimal forwarding hop distance determination, as per the theory provided by Ramaiyan et al. in (HJ 

2) Hop Count Ration based Localisation (HCRL) (O). 

Related Literature: Narayanaswamy et al. iflOl provide the COMPOW protocol for obtaining Q c nt- The 
idea here is to operate all nodes at the lowest common power level of available discrete power levels 
while ensuring connectivity in the network. As the communication range is an increasing function of the 
transmission power, the minimum common power results in the minimum range for connectivity, and thus 
yields Q cr it- However, COMPOW requires distance-vector routing to be done for each available discrete 
power level; switching between power levels requires synchronisation among the nodes. Unlike COMPOW, 
the proposed algorithm DISCRIT does not require multiple power levels and synchronisation between nodes. 
Also, DISCRIT needs to be run only once by the nodes to obtain Q cr it, unlike COMPOW where routing 
has to be done for each power level. The literature related to using hop-distance as a measure of inter-node 
distance is discussed later in Section [W] 

Outline of the Paper: Section [II] gives the algorithm DISCRIT for obtaining an approximation to Gcrit along 
with the associated theory (III-DI) . Section [HI] provides simulation results on the performance of DISCRIT for 
various deployments. The CGG based distance discretisation technique and its justification are provided in 
Section |lVl Finally, as an application of distance discretisation using DISCRIT, we provide numerical results 
for (i) a self-organisation formulation (Section [V), and (ii) an approach for approximate node localisation 
(Section [VTT >. Section IVIII concludes the paper with future work. 

II. Theory and the Algorithm 

In this section, we first arrive at an algorithm for Q cr u by making use of a result by Penrose 0. The 
algorithm requires each node to know the distances to its neighbours. The distance-free distributed algorithm 
(DISCRIT) is then obtained by using link-weights obtained from a Hello-protocol-based neighbour discovery 
as distance-like information. 

A. Degree-1 GG and Penrose's Theorem 

Given node locations V, let ri(V) be the smallest r such that the corresponding GG, Q(V, r) has no isolated 
node, i.e., Q(V, r) has the degree^ of at least 1. It can be seen that ri(V) is precisely the maximum of the 
nearest node distances, i.e., 



'The degree of a node in a graph is the number of its adjacent nodes. The degree of a graph is the minimum of its node degrees. 



n (V) = max < min joL > (1) 

We call the corresponding GG, C/(V, ri(V)) as the degree-1 GG, and denote it by Q\ and ri(V) by ri 
when the node locations are implicit. 

Consider a uniform i. i. d. deployment of n nodes. Thus the random node location vector V corresponds to 
the probability space (A n ,T n ,V n ) where V is the uniform measure on A, V n is the corresponding product 
measure, and T n is the Borel field in A n . Theorem Q] gives the relationship between Q\ and Gcrit for uniform 
i. i. d. deployment. 

Theorem 1 (Penrose H^j): Let /Ofc(V) be the minimum r at which Q(V, r) is k-connectedQ and 0fc(V) be 
the minimum r at which Q(V,r) has degree k. Then 

lim V n {\ : p fc (V) = a k (V)} = 1 



Since pi(V) = r cr u(V) and 01 (V) = n(V), we have 

Coro/Za^ i: lim^oo P"{V : r crit {V) = n(V)} = 1 ■ 

Thus, Corollary Q] indicates that, for a dense node deployment, Q\ is identical to Q c Ht w.h.p. [| Note that in 
general, a graph with no isolated nodes need not be connected. But the result above implies that, if the graph 
is a GG and the node deployment is uniform i. i. d. then just having no isolated nodes ensures connectivity 
w.h.p. We thus look for a distributed construction of Q\, as it is the same as G C Ht w.h.p. 

Now, suppose each node % knows the distances dij to each of its neighbours j. Then a node would know 
its nearest-neighbour distance too. As r\ is the maximum nearest-neighbour distance from (fl}, a distributed 
maximum-finding algorithm can be run by each node to obtain r\. One such distributed maximum-finding 
algorithm is described in Section III-BI Once n is known, each node includes all nodes within a distance 
of r\ as its adjacent nodes. This results in a GG of radius r\, which is Q\. 



B. An Algorithm for Q\ using Distance Information 

Here, every node i maintains a range r(i) and an adjacent node list N(i) which get updated as the algorithm 
progresses. At any iteration, N(i) is the set of nodes whose distances are less than range r(i) from node i. 

1) Initialisation: For every node, the range is initialised to its nearest neighbour distance, and the adjacent 
node list contains only the nearest neighbour(s) and itself. That is, for all i G N, 

r (0) (i)= mm {dij} and iV (0) (i) = {j e N : d { ,• < r (0) (i)} 

Set the iteration index k = 

2) Current range unicast: Every node i informs its current range r( k \i) to all its current adjacent 
nodes, i.e., the nodes in N^ k \i). Therefore, the node i also receives values of current ranges from 
some of the nodes which belong to the set S^ k '(i) = {j € N : i G iV^fj)}. 

3) Updating the Adjacent Node List: Every node i then updates its current range A k+1 \i) to the 
maximum of the ranges it received. The maximum finding includes the node's current range A k \i) 



2 A graph is fc-connected if there exist k independent paths between any two nodes. 
3 w.h.p. stands for "with high probability," i.e., "with probability — > 1 as n — > 00". 



also. The adjacent node list N^ k+1 \i) is also updated accordingly as the set of nodes whose distances 
from i are within A k+1 \i). 

A k+1 \i) = max{r^(i) : j G S ik \i)} and N {k+1 \i) = {j G N : d l} < r {k+1 \i)} 

4) Terminating condition: The algorithm terminates if all the ranges in an iteration remain unchanged, 
i.e., 

IF r( fe+1 )(i) = r {k \i) for all i G N, Call r(i) = r {k \i) and N(i) = N^ k \i); STOP 
ELSE Set k := k + 1; go to Step 2. 

Note that this is a centralised terminating condition. A distributed terminating condition is discussed 
later. 

5) Obtaining the topology: The graph resulting from the algorithm Ga is obtained by joining each node 
i to each node in its final adjacent node list N(i). i.e, Ga = (V, Ea) where Ea = '■ i G N, j G 

The convergence of the algorithm output to Gi is given in the following theorem. 

Theorem 2: If Gi is connected, then the algorithm converges to Gi, i.e., Qa = Gi, in at most D iterations, 
where D is the hop diameter of Q\. 

Proof: See Appendix |A] for the proof. ■ 

From Corollary [T] Gi is indeed connected with high probability, i.e., Gi = Gcrit w.h.p. Thus from Theorem 
|2j the algorithm above converges to Gcrit w.h.p. 

Remarks 2.1: One can construct deployments where the algorithm fails to give a connected graph. However, 
these cases happen only when Q\ is not connected (for if Q\ is connected then Theorem [2] holds); the 
probability of this, as discussed above, is very small for dense networks (probability — > as n — > oo). 



Remarks 2.2: Gupta and Kumar iTTTTl have shown that r CT a scales as r(n) = Q(y w.p.l as n — > oo. 
Hence the hop diameter of Q cr it scales as @(^y) = ©(yis^) w.h.p. Thus, the algorithm converges in 

iterations w - h -P- 

Remarks 2.3: Note that after updating its range r^(i) to r( fc+1 )(i), the node i needs to communicate its 
new range (i.e., execute Step 2) only if it is different from the earlier range. In such a case, there will be no 
communication when the terminating condition in Step 4 is met. Hence, a distributed terminating condition 
would be that, after informing its current range, each node waits for a certain time-out period, and locally 
decides to terminate the algorithm if it receives no communication from any node during this period. 

1) Extension to Monotone Functions of Distances: Theorem [2] can be shown to hold even in the case where 
the distances dij in the algorithm are replaced by f(dij), where / is monotone increasing. Thus, if we have 
a distance-like information (a monotone function of distance) known at each node about all its neighbours, 
then a distance-free distributed algorithm for Gcrit can be obtained. For this purpose, we consider the Hello- 
protocol-based neighbour discovery proposed by Karnik and Kumar Q, and use certain link weights obtained 
from the neighbour discovery as surrogates for distances. We assume that the antennas of all the motes have 
isotropic radiation patterns. Such radiation isotropy is a valid assumption for external "stick" antennas. The 
resulting distance-free algorithm, DISCRIT, is described in Section lTl-DI The Hello-protocol-based neighbour 
discovery is described next, along with a discussion of monotonicity of link weights under these conditions. 
Under antenna pattern anisotropy (e.g., on-board "patch" or "integrated circuit" antennas), however, these 
link weights are found not to be monotonic with distance. In related work (not reported here) we have 



pursued an approach to collate the link weights from neighbouring nodes in order to obtain a monotonic 
function of distance. 



C. Hello-Protocol-Based Neighbour Discovery 

For the present discussion, we consider a slotted system. Note that this is not a necessary requirement. 
Indeed, the algorithm developed here easily applies to CSMA/CA scheduling. We have implemented the 
algorithm on a Qualnet simulator where IEEE 802.11b CSMA/CA is used, and the results reported later 
in Section |777] are with the CSMA/CA. In each slot, a node chooses to be either in the transmit mode with 
probability a, or in receive mode with probability 1 — a. Whenever in the transmit state, the node broadcasts 
a Hello packet where the Hello packet could simply be a packet containing the node id. Let Hi j represent the 
fading coefficient from i to j, with cumulative distribution A(.), which is assumed to be identical across all 
transmit-receive pairs (i, j). To model if a Hello packet of a transmit node i has been successfully decoded 
by a receive node j, the physical model for communication can be used. That is, transmission from i to j 
in a slot is successful if the signal-to-interference-plus-noise ratio (SINR) of i at j is above a threshold /?, 
i.e., if 

SINR(i,j) = ' J u > P 

where P t is the node transmit power at a reference distance, 77 is the path loss exponent, and a 2 is the noise 
variance. 1^ is the indicator function of the event that the node k is simultaneously transmitting in the same 
slot, given that i is transmitting in the slot. 

This process of broadcasting Hello packets is carried out for a large number of slots t. Let Cij(t) be the 

count of Helios of i received at j. Let BAt) be the number of Helios broadcast by i during this period. At 

c (t) 

the end of t slots, each receive node j calculates £ 3 ^ J which is the fraction of Helios of i decoded by j. 

Let pij be the probability that i's Hello is successfully received at j, under the broadcast setting described 
above. Thus pij is the probability that in the slot in which i is transmitting, j is in receive mode, and the 
SINR of i at j is greater than (3, i.e., 

/ Hi jPt/dh \ 

Pij = (1 " «)Pr -5 I ' J p m r >P\ (2) 



° 2 + Eke 



Since the system activity is independent from slot to slot, by the strong law of large numbers, we have, 
with probability one 

= Ptj (3) 

t^oo Bi(t) 

The pi j estimate thus obtained above is used by node j as the receive link weight of the link We 
now analyse the monotonicity of p^ j with dij for its applicability as distance-like information. 

1) Monotonicity of pij with dij Under Isotropic Conditions: From Equation |2j we have 



Pij = (1 - a)Pr < f- > 



v 2 + Ek & N (HkjPt/dlAh - H id P t /d. 
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where Ij := ^2 keN [H^jPf/ 'd\ A Ik is the random variable that represents the total power received at j. 

For a large spatially homogeneous network, where the node density is constant over the entire region, the 
total power received at each node Ij can be assumed to be identically distributed at all node locations, with 
common CDF This assumption will be shown to be valid (to a good approximation) for the interior 
nodes using simulations in IIII-Ai Then, using A(.), the cumulative distribution function of iJjj (which is 
assumed to be identical across all the equation above becomes 



n (HW 2 \ 



dA(h) (4) 



Since F(.) and A(.) are monotone increasing functions (as both are CDFs), it can be seen from Equation |4] 
above that pij is monotone decreasing with dij. This allows us to replace dij in the previous algorithm (in 
Section Hl-Bb by —pij, to obtain a distance-free algorithm which is described next in Section Hl-Dl 

D. DISCRIT: Distributed CRITical geometric graph algorithm 

At the end of the Hello-protocol based neighbour discovery, each node i has link weights p^j for each of 
its neighbours j. Every node i maintains a p-threshold p(i) and an adjacent node list N(i). At any iteration, 
N(i) is the set of nodes j whose pj^ values are greater than or equal to p-threshold p(i). 

1) Initialisation: For every node i, the p-threshold p(°\i) is initialised to the maximum link weight, and 
the adjacent node list N^ k \i) contains only the node(s) with the maximum weight. That is, for all 

iGJV, 

p(°\i) = max{»ji} and N^(i) = argmax{p, j} 
j j 

Set iteration index k = 0. 

2) p-threshold unicast: Every node i informs its current p-threshold p^ k \i) to all its current adjacent 
nodes, i.e., nodes in N^ k \i). Thus, the node i also receives the p-thresholds p^ k \j) from some of its 
neighbours given by the set S^>(i) = {j : i E N^(j)}. 

3) Updating the Adjacent Node List: The node then updates its p-threshold p^ k+1 \i) to the minimum 
of the p-thresholds it received. The minimum finding includes the node's current p-threshold p^ k \i) 
also. The adjacent node list N^ k+l \i) is also updated accordingly as the set of nodes whose pj^s are 
greater than the updated p-threshold p^ k+1 \i). Let t^ k \i) = mm{p( k '(j) : j G S^ k '(i)}, which is the 
smallest of the p-thresholds received by i. Then 

p^ k+1 \i) = min{pW(i),t (fc) (*)} and N^ k+l \i) = {j : Pjji > p( fe+1 )(i)} 

4) Terminating Condition: The algorithm terminates if all the p-thresholds in an iteration remain 
unchanged, i.e., 

IF p( fc+1 )(i) = p(*)(») for all i G N, Call N(i) = N^ k \i), STOP 



ELSE Set k = k + 1, go to Step 2. 

The distributed terminating condition is as described for the algorithm based on distances, and can be 
used by the nodes to terminate locally. 
5) Making links bidirectional: Let S(i) = {j : i G N(j)}. Thus S(i) represents the nodes having i as its 
adjacent node. The bidirectionality is achieved by updating the adjacent node list as N(i) = N(i)L)S(i). 
The graph resulting from the algorithm Q\ is then given by Q\ = (V,Ei) where E\ = : i G 

N,jeN(i)} " ' ' m 

Remarks 2.4: Unlike distances dij, pij ^ pj^ in general. Therefore, the graph obtained after the terminating 
condition in Step 4 need not be bidirectional. Hence, additional edges are added to the graph in Step 5 to 
ensure bidirectionality. 

Remarks 2.5: Note that for DISCRIT to be valid, we need spatial homogeneity for pij monotonicity as 
well as the Penrose's result (Corollary [T]) to hold. While randomised lattice and grid deployments are 
spatially homogeneous, a uniform i.i.d. deployment, in general, can create sparse and dense node placements. 
However, as n is increased, uniform i. i. d. deployment is homogeneous w.h.p. in a sense described below. 
Further, Corollary Q] is known to be true for uniform i. i. d. deployment. We will use simulations to study 
the applicability to other deployments. 

Given a uniform i. i. d. deployment V, any r > 0, and any point x within the region A, define closed disc 
of radius r around x, D r (x) := {y G R 2 :|| y — x ||< r}. Let N r (x;~V) be the number of nodes in D r (x), 
i.e., within a radius of r around x. Define the interior of A as A(r) = {x G A : D r {x) C .4}. The following 
theorem gives the joint convergence of N r (x; V) for all x G A(r). 

Theorem 3: For any e > however small, 

lim V n { V : ^-(1 - e) < N ^ x '^ < _^L(i + e ) for every x G A(r' 
n^oo I \A\ Trr 2 - I A I 

Proof: See Appendix iBl ■ 

The result above implies that, for a uniform i. i. d. placement of nodes, the node density around every interior 
point, i.e., Nr< f r '^ , is arbitrarily close to the network density ™ w.h.p., thus implying the homogeneity of 
dense uniform i. i. d. deployments. 

Remarks 2.6: For finite regions, even when the deployment is spatially homogeneous, there will be an edge 
effect. The received power at a periphery node is usually less compared to the receive power at the nodes 
in the interior, because of the difference in node density at the periphery and the centre of the region (non- 
homogeneity at the edges) 0. This edge effect will distort the behaviour of DISCRIT, as will be seen in the 
simulation results in Section |III] A remedy for edge effect is to extend the node deployment beyond the 
boundaries of the area of interest, so that the actual region of interest does not experience the edge effect. 

III. DISCRIT:Simulation Results 

A. Power Distribution 

In Section III-DI we assumed that the total power at a receiver node (Signal + Interference + Noise) is 
identically distributed at every receiver. When the number of nodes is sufficiently large, a receiver far away 



4 Note that even the homogeneity result in Theorem [3] is applicable at the interior of the region, and not at the edges. 



from the edges sees almost the same concentration of nodes around it and hence experiences the same 
distribution of total received power. This is particularly so if the exponent for power loss with distance 77 
is large, because only the nearby nodes can make a measurable difference to the power received. This also 
reduces the "edge effect" seen by the receivers close to the edges. 

For a verification of this in simulation, two random deployments of nodes in a unit square were considered, 
one with 1000 nodes and the other with 5000 nodes, rj is also varied from 2.0 to 4.0. In each case, a large 
number of time slots of the slotted- Aloha protocol were simulated in Matlab, where each node transmits 
with some probability a independent of all other nodes, and acts as a receiver otherwise. The nodes in the 
unit square are also divided into 5 concentric annular regions of equal width (of 0.1 units). The total power 
received at the nodes in each region was calculated and an empirical probability of occurence of each value 
of power (upto a certain resolution) was calculated. The resulting empirical distribution of powers is shown 
in Figure Q] 

We notice from Figure Q] that, for a given number of nodes (1000 or 5000), the assumption of the same 
received power distribution across nodes becomes better as the path loss exponent, 77, increases, and as the 
nodes are taken farther away from the edge of the region. The assumption also works better for a larger 
node density. In fact, we see that for 5000 nodes, distributed independently and uniformly over the region, 
and 77 = 4, the approximation is excellent for nodes lying at points greater than 0. 1 units from the boundary. 

Distribute rb III nsr'arsd Divided into Distributors s : Tuts Pmai a: Rette ic h- nudes Divided into Distributions olTclal Power at deceiver tor rades Divided into 
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Fig. 1. Empirical histograms of total received power at nodes in 5 concentric annular regions in a 1 Km x 1 Km region, with 
the nodes distributed independently and uniformly over the region. The top row corresponds to 1000 nodes, and the bottom row to 
5000 nodes. The columns in this 2x3 array of plots correspond successively to path loss exponents r\ — 2, 3, 4. 
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Fig. 2. Uniform i.i.d. deployment: Comparison of the critical geometric graph and the approximation provided by DISCRIT. The 
leftmost column of plots shows the node locations, the middle column the critical geometric graph, and the rightmost column the 
result obtained from DISCRIT. In the bottom row of plots the algorithm is run only on the "interior" nodes shown in the leftmost 
plot. 

B. Performance of DISCRIT 

Simulations were carried out in Matlab using both random and uniform (randomised lattice) deployments of 
1000 nodes in a unit square region. In both cases, the DISCRIT algorithm was performed on the deployment 
under isotropic conditions and the results were compared with the respective Critical Geometric Graphs for 
the deployments. 

Figure [2] provides results for uniform i.i.d. deployment. From a visual comparison of the actual CGG and 
the graph provided by DISCRIT we conclude that DISCRIT provide a graph with a similar visual structure, 
though with fewer links (we will evaluate this quantitatively below). In the light of the discussion about 
"edge effects" in Remarks 12.61 the nodes that were less than 0.1 units away from any edge of the unit square 
were removed and all the preceding analysis was performed on the interior node deployments, using the 
same link weights from Hello protocol for the entire deployment. The resulting DISCRIT output graph and 
CGG can be seen to be visually much closer to the actual CGG on the interior nodes. Similarly, Figure [3] 
provides the results from DISCRIT for the randomised lattice deployment. 

1) A quantitative measure of similarity between the CGG and output of DISCRIT: We introduce a two-part 
measure to estimate the similarity between the approximate GG given by DISCRIT, Q\ and the true critical 
geometric graph, Q C Ht or the FNNGG Q\. A geometric graph (GG) of radius r is defined by two criteria : 



Fig. 3. Randomised lattice deployment: Comparison of the critical geometric graph and the approximation provided by DISCRIT. 
The leftmost column of plots shows the node locations, the middle column the critical geometric graph, and the rightmost column 
the result obtained from DISCRIT. In the bottom row of plots the algorithm is run only on the "interior" nodes shown in the leftmost 
plot. 



1) No edges of length more than r should be present. 

2) All edges of length at most r should be present. 



Accordingly, we have defined the function D(.) on a pair of graphs G\ and G2 as D{G\,G2) = TeIT ' 
where is the edge set of graph Gi, for i = 1, 2. D(Q\,Q cr i t ) is the fraction of edges of Q\ that are longer 
than r cr it and D(Q cr i t ,Gi) is the fraction of edges shorter than r cr u that are missing in Q\. Ideally, both 
these quantities should be 0. 

Simulation results shown in Table U show that the DISCRIT output is a very good approximation to the 
corresponding critical geometric graph and FNNGG, particularly when only the interior nodes are considered. 
Note that the exact degree- 1 GG is in fact identical to the critical GG for the i.i.d. and grid deployments as 
well as for the interior nodes in the randomised lattice deployment. This shows that Penrose's Corollary \T\ 
though applicable only asymptotically, practically holds even for finite node densities. 
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TABLE I 

Measures of disparity between the DISCRIT graph, the exact degree- 1 geometric graph, and the critical 

GEOMETRIC GRAPH, FOR THE UNIFORM I.I.D. DEPLOYMENT, THE RANDOMISED LATTICE DEPLOYMENT, AND THE 

DETERMINISTIC LATTICE GRID. 



IV. CGG based Distance Discretisation 



In this section we show how Q cr it and DISCRIT can be useful in obtaining various optimal topologies in 
a distributed fashion. Given a graph, the hop-distance H between two nodes on the graph is defined as the 
minimum number of hops on the graph between the nodes. We propose to use the hop-distance on Qcrit 
between two nodes as a measure of (Euclidean) distance between them. This technique is distributed since, 
we already have DISCRIT which is a distributed construction of Q cr it, and hop-distance calculation can be 
done using the distributed Bellman-Ford algorithm. 

The concept of using hop-distance as a distance measure already exists in the literature; the important 
applications being DV-hop routing by Niculescu and Nath |2], and localisation by Nagpal et al. in Q. In 
all these methods, the hop-distance is calculated on a specific geometric graph Q(V,Rq) where Ro is the 
communication range of each node. Note that for a flat terrain and omnidirectional transmission, the edges 
in Q(V, Ro) join all possible direct communication neighbours. However we intend to use hop-distance on 
Qcrit- Our technique is advantageous over the existing methods like DV-hop in the following ways: 

1) Gcrit is an intrinsic structure of the node layout, unlike Q(V,Rq) which is dependent on the com- 
munication parameters. Hence the proposed distance discretisation technique is independent of the 
communication setup. 

2) It is shown in that the error in distance estimation is proportional to the radius of the GG used 
for hop-distance calculation. (To see this, consider a line and if one has to express distance in integral 
multiples of r, then a mean error of 0.5r is obtained.) Thus, since r CT it is the smallest GG radius 
which ensures connectivity, using Q cr it provides better distance resolution than Q(V, Ro)- Also, unlike 
a fixed Ro, r cr it decreases with increase in n (see scaling of r cr it in ifTTl ): thus the distance estimation 
using Qcrit keeps improving with n (also see numerical evaluation below). 

Given node locations V, let hij represent the hop-distance between i and j on Q cr it- Define pij = 
Note that the proposed distance discretisation is valid if the hop-distance hij is proportional to the Euclidean 
distance dij, i.e., if pij is a constant for all (i,j) pairs. Simulation results show that as n increases, dij 
becomes roughly proportional to hij. 
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A. Numerical Evaluation 

We consider different values of n varying from 100 nodes to 5000 nodes. For each n, a sample uniform 
i.i.d. deployment V is considered. For each V, the CGG Gcrit is found, and pij is evaluated for every node 
pair Figure [4] shows the empirical distribution (normalised histogram) of pij for different n. As n is 

increased, the support of distribution moves to the smaller values and the distribution becomes narrower. In 
other words, the variation of p^j over node pairs decreases as the node density is increased. 

Figure [5] and Figure [6] show the plots of empirical variance cr^(V) and the empirical coefficient of variation^ 
CV P (V) of pij against n (along with the confidence intervals). The values of a p (V) and CV P (V) are found 
to decrease as n is increased. Thus the plots indicate that the pij becomes roughly a constant for large n, 
thus justifying the proposed distance discretisation technique. 

V. An Example Application: Optimal Self-Organisation 00 

Here we illustrate the usefulness of Q cr it based distance discretisation (and hence of DISCRIT) to self- 
organisation problems involving distance information. We consider a self-organisation problem formulated 
in (8| of obtaining the optimal hop-length which maximises the transport capacity on a single-cell dense ad 



5 hop-distance is a "distance" as it satisfies the properties of non-negativity, symmetry, and triangle inequality 
6 The coefficient of variation is the ratio of standard deviation to the mean. 
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Fig. 8. Comparison of theoretical Transport capacity with 
those obtained form simulation. The hop-distance h is replaced 
by mean hop-length in Th. 



hoc network. There is a dense deployment of nodes in a limited area. Source-destination pairs are chosen 
randomly and the traffic is assumed to be homogeneous. A multihop ad hoc wireless sensor network needs 
to be self-organised in such a way that all communication hops are of equal length d. The multiple access 
protocol is such that only one successful transmission can occur at any time in the network (i.e., there is 
no spatial reuse). Although [8| considers a fading channel model, we will restrict to the no-fading case. 
There is an node transmit power constraint, P t , and the nodes have the capability of achieving the Shannon 
capacity over the hop-length d, that is, the bit rate of Wlog (l + ^pr) where W is the bandwidth and «o is 
a constant accounting for power gains between the transmitter and receiver. Under this setup, the aggregate 
bit rate carried by the system when all nodes transmit over a distance d per hop, takes the form (see (H) 
\(d) = a log (l + ^pr) where a is a constant which depends on the contention parameters. The objective 
is to maximise the network transport capacity (in bit-meters/sec) given by 



V(d) = d\(d) = a d log 



+ d^a 2 J 



(5) 



over all hop-lengths d. It can be seen that there exists an optimal hop-distance d op t which maximises ^(d). 
The trade-off comes from the fact that if the network self-organises into short hop lengths, then the bit rate 
achieved over a hop is large, but each packet has to traverse many hops. On the other hand if d is large 
then the bit rate over a hop will be small, but fewer hops need to be traversed. 



A. A Self-Organisation Heuristic 



We aim to obtain a topology whose hop-lengths are close to d op t- We employ the distance discretisation 
technique described in Section [TV] to convert the problem of finding d opt to one of finding an optimal hop- 
distance (on Qcrit), h opt , in order to maximise transport capacity. The resulting self-organisation algorithm 
is described below. 



1) Obtain the critical geometric graph Q cr it' Given the node locations, obtain Q cr it- This is required 
to obtain the hop-distance information. 

2) Obtain /i-hop-distance topology Th'. For h > 1, the /i-hop-distance topology Th is obtained by having 
an edge between all node-pairs with hop-distance hi j = h on Q cr it- The edges (hops) in this 
topology are considered to have h units of distance. T%, thus, denotes the CGG. 

3) Find the optimal hop-distance h opt : Using each topology Th, find the corresponding network transport 
capacity VP/j. The optimal hop-distance topology T opt (equivalently the optimal hop-distance h op t) is 
chosen to be the one which maximises 



B. Numerical Results 



The uniform i.i.d deployment shown in Figure [2] is considered. We use both the centralised Gcrit and the 
distributed Q\ for hop-distance calculation. The hop-distances are calculated using distributed distance-vector 
routing, using which /i-hop topologies {Th} are found. 

Nodes are assumed to be saturated and attempt for the channel with a fixed probability. For each topology 
Th, the following is done. Whenever successful, a node transmits randomly to one of its adjacent nodes in 
Th, i.e., to a neighbour h hops away on Q cr it- Each node counts the number of bit-meters transmitted by 
it. At the end of certain time t, the network transport capacity ^ is calculated as the average bit-meters 
transmitted by all the nodes in unit time. 

The plot of transport capacity ^ against the hop-distance h is shown in Figure |7J The optimal hop-distance 
hopt = 3 when either of Q cr it or Q\ is used. We may conclude that using T3 could be optimum for the 
transport capacity objective. For comparing the result from the heuristic with the theory, we plot the transport 
capacity ^ against the mean hop-length in Th, and then compare it with the theoretical plot obtained using 
Equation [5] The plots are shown together in Figure [8] The plots being close to each other verify the validity 
of the proposed heuristic, and the applicability of the proposed distance discretisation technique. 

Thus, for the example scenario above, nodes will use T3 as the communication topology in order to obtain 
a good transport capacity. That is, each node will communicate directly with nodes 3 hops away in Q CT it 
(or in DISCRIT output Qi) as h opt = 3. 



VI. Application to Node Localisation 



We illustrate another use of Q cr u and DISCRIT in estimating the locations of the nodes in a sensor network. 
The method described here was conceived independently by us [12], but has been also reported by Yang et 
al.(9], who call it Hop Count Ratio-based Localisation (HCRL). But the difference is that, like other existing 
methods which use hop-distances, HCRL also uses Q(V,Rq) (described in Section HVT) for hop-distance 
calculation, while we use Gcrit- 



A. Theory 

Given the node deployment V, consider ng beacons (nodes whose positions are known) given by the set 
{.Bl, B2, . . . , B nB }. Let S be a node whose location is to be determined. From the distance discretisation 
technique described in Section |IVJ we take hs,B t oc ds.Bj approximately for large n. Thus, for any 1 < 
i,j < i^b, we have ^ ^ 

~ TT^ = ri >i (say) (6) 

ds,B, hs,B j 

Let (x,y), (x{,yi), and (xj,yj) be the co-ordinates of S, B{, and Bj respectively. Then, taking the approx- 
imation in © to be an equality, we have 

y/(x- Xi) 2 + (y- Vi) 2 

= ri j solving which we get 

y/(x - xj) 2 + (y - yj) 2 

(1 - rlj) x 2 + (1 - r y y 1 -2( Xi - r^x 2 ) x - 2 ( w - r^ % ) y + (x 2 + y 2 ) - r 2 hJ (x 2 + y 2 ) = (7) 

When Vij = 1, the equation above is linear and represents the perpendicular bisector of line joining Bi and 
Bj, whereas when r^j 7^ 1, it represents a circle. In general, such a circle is called an Apollonius circle 
ll9l l[T3l . As we need a minimum of 3 such circles to get a point estimate of the node location, we need at 
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least 4 beacons. Note that 3 Apollonius circles can be obtained even with 3 beacons, but all 3 intersect at 
2 points in general, and hence an additional beacon is required. 

B. Numerical Results 

We consider the uniform i.i.d. deployment of 1000 nodes in a 1 Km x 1 Km square shown in Figure |2] 

4 nodes nearest to 4 corners of the square are taken as beacons (shown as o in Figure [9]). A sample node 

5 is considered for the purpose of illustration (shown as x in Figure [9]). The hop-distance calculation is 
made on the corresponding Q cr it- 3 of the 6 possible Apollonius circles are shown in Figure [9] The circles 
correspond to beacon-pairs (1,2), (1,3) and (1,4). (Note that if hi j were exactly proportional to d^j, then 
all the Apollonius circles would intersect at a point, which would coincide with the actual location x.) To 
obtain an estimate, we use an inbuilt MATLAB function which solves an optimisation problem involving 
the circle equations. The estimate is shown as . in Figure [9] The error in the position estimation, i.e., the 
distance between the node's actual position and the estimate is 41 m (comparable with r cr j t =56.7 m). 

To observe the localisation performance for all nodes, an error pattern is obtained by joining each node's 
actual position and its estimate. The error pattern is shown in Figure [TOj The mean estimation error is 55 m. 
The same experiment is repeated by performing hop-distance calculation on the DISCRIT output Q\. The 
resulting error pattern is shown in Figure [TT] The mean error is 74 m in this case, which is higher compared 
to that of centralised Q C rit- In both the cases, the error is large towards the edges because of smaller node 
density at the edges and since the distance discretisation technique is valid for large node densities (see 



Section [TV). 



VII. Conclusion 

The paper provides DISCRIT: a distributed algorithm to approximate Q cr u, using link weights obtained from 
the Hello-protocol-based neighbour discovery. The validity of DISCRIT is shown (with high probability) 
for a dense uniform i. i. d. deployment, by making use of Penrose's result and the spatial homogeneity 
of the deployment. Simulation results are shown for other types of deployments too, which indicate that 
the algorithm provides good approximations of Q cr it- Using Q CT it (and DISCRIT), a distributed technique 
of associating distances to links joining node pairs is also proposed. Example applications of Q C nt based 
distance discretisation are shown for a self-organisation problem of obtaining the optimal hop-length which 
maximises transport capacity for a dense ad hoc network operated as a single-cell, and also for node 
localisation. 

In related work, we have also considered the problem of anisotropic antenna radiation patterns, which renders 
invalid the direct use of Hello counts as described in this paper. However, assuming that antenna patterns 
are randomly and uniformly oriented, nodes can locally cooperate to address the anisotropy problem. 

Other extensions of this work could include handling spatially non-homogeneous deployments, which in 
turn will also help reduce the edge effect seen in DISCRIT outputs. Other practical issues to be examined 
are self-tuning of parameters such as the Hello transmission latency and node transmit power for improved 
performance of DISCRIT. 

The algorithms and techniques proposed in the paper, apart from being distributed and asynchronous, require 
only limited capabilities from the nodes. The nodes are not equipped with position finding devices such as 
GPS. Receive Signal Strength (RSS) based techniques for distance estimation, which require accurate power 
measurement and scheduling, are also not used. Thus, our approach can provide a simple yet effective way 
of self-organisation in sensor networks. 
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Appendix 



Appendix A 
Proof of Theorem [2] 

Theorem: If Q\ is connected, then the algorithm converges to Q\, i.e., Qa = Gi, in at most D iterations, 
where D is the hop diameter of Q\. 

Proof: The proof proceeds in a few lemmas. 

Lemma A.l: For all k, for all i G N, A k \i) = for some j G N (j depends on i and k). 

Proof: Induction on k: From Step 3 of the algorithm, we have A k+l \i) = max{r^^(j) : j G S^^i)}. 
Therefore, 

r (k) {i) = r (fc-i) (il ) = . . . = rWfo.!) = r (o) (j -) 

for some ii , Z2 , . . . , j G N. ■ 

Lemma A.2: For all i £ N, for all fc, A k \i) < r\. 

Proof: From Lemma IA~T1 A k \i) = r^\j) for some j G iV. Therefore 

r^ k \i) = r (0) (j) = nun {dj- jfc} < max{ min {d jtk }} = n 

keN,k^j jeAf keN,k^j 

m 

Lemma A.3: If A k \i) = r% for some A;, then for s = 1, 2, 3, . . ., A k+S \i) = r\ 
Proof: Note that i G N^ k \i) since = 0, and hence i G S^ k \i). Thus 

r( fc+1 )(i) = max{r^(i) : j G S (fc) (*)} > A k \i) = n 
But from Lemma IA.2I r( fc+1 )(i) < r±, therefore A k+1 \i) = r\. 

The argument can be extended to show A k+S \i) = r± for s = 2, 3, ■ 

Let Z = argmaxj g Ar{r(°)(z)}. Then, r(°'(i) = ri. Let C(h) = {i : i is connected to Z in h hops in 

Lemma A.4: For all i G C(/i), A h \i) = r\. 

Proof: Induction on h: i G C(l) means that d;j < n = r(°)(Z). Therefore, i G iV°(Z) Z G S^(i), 
and hence, r^(i) > = r%. But from Lemma IA.2I r^'{i) < ri. Therefore, r^(i) = ri. 

Assume, for all i G C(h), A h \i) = r%. Now, for any i G C(/i + 1), there exists a j £ C(/i) such that 
did ^ r i- Thus r(h) (i) = n i G N( h \j), and hence j G S^i). Therefore r^+^i) = m&x{A h \t) : 
t G SW(»)} > r( k )(j) = n. Again from Lemma Q r^ +1 )(i) = n for all i G C(h + 1). 

Lemma IA.4I now follows from the induction principle. ■ 

Lemma A.5: Suppose Gi is connected. Let m be the maximum number of hops in the least-hop path on Q\ 
between any node % and Z (m is the hop radius of C?i centered at Z). Then the algorithm converges in m 
iterations. Moreover r(i) = r\ for all i. 

Proof: Take any node i G N. Since Q\ is connected, j G C(h) for some h < m. Now from Lemma [A4l 
A h \i) = r\ and therefore from Lemma IA.3I A m \i) = r\ as m > h. 



Now at m th iteration, we have A m \i) = n for all i G N. Therefore from Lemma lA!2l A m+1 \i) = n for 
all i. 

Thus at Step 4, the algorithm terminates and r(i) = r± for all i G iV. ■ 

Thus, from Lemma IA.5I r(i) = n, and ./V(i) = {j : djj < ri}. Now from Step 5 of the algorithm, it can 
be seen that Ga = Gi- Theorem [2] follows by observing the fact that m < D, the hop diameter of Gi- ■ 



Appendix B 
Proof of Theorem [3] 

Theorem: For any e > however small, 

lim V n { V : -^—(1 - e) < Nr ^ x '^ < + e ) for every x G Mr) 

n^oo { \A\ trr 2 ~ | A | 

Proof: Note that the expression contained within V n {.} involves an intersection of (uncountably) infinite 
events, where each event corresponds to a bound on N r (x; V) for each point x G A(r). For this purpose, we 
make use of uniform convergence of weak law of large numbers, given by Vapnik-Chervonenkis theorem. 

Given a set U, let C be a collection of subsets of U. Let F be a finite subset of U. F is said to be shattered 
by C if for every subset G of F, there exists a subset C G C such that F n C = G. The VC-dimension of 
C is defined as the supremum of sizes of all finite sets (Fs) that can be shattered by C. 

Theorem 4 (Vapnik and Chervonenkis): If C is a set of finite VC-dimension dc, and {Xi} 1 ^ 1 be a sequence 
of i.i.d random variables taking values in U with common probability distribution P, then for every e',5 > 



Pr ( sup 



111 



P(C) 



< e' > 1 - S 



whenever 

d c . 16e 4 2 
m > max{— log — — , — log - j 

e e e o 

■ 

Here Is \ represents the indicator function. For our purpose, let U represent R 2 . Then F represents a finite 
set of points in R 2 . Define a closed disc of radius r > and centre x as D r (x) := {y G R 2 :|| x — y \\< r}. 
Let C represent a collection of all closed discs in R 2 , i.e., C := {D r {x) : x G R 2 , r > 0}. Gupta and Kumar 
in (51 have shown the following. 

Lemma 6: The VC-dimension of set of all closed discs in R 2 is 3, i.e., dc =3. ■ 

By definition D r (x) G C for every x G A{r). Now if we take the common probability distribution P in 
Theorem 0] to be the uniform measure V on A, then it can be seen that i.i.d. random variable sequence {Xi} 
is nothing but a uniform i.i.d. deployment V on A. Now observing that V{D r (x)} = ™- for all x G A(r), 
Theorem [4] and Lemma [6] give rise to the following result. 



Corollary 2: For every e',5 > 



V 



n I A 



< e' for every x G A(r) > > 1 — 5 



i . 3 16e 4 2 
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e e e o 
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Thus Corollary [2] becomes 
For every e',6 > 



\ |^4| V vrr 2 / ~ 7rr 2 



< 



i + 



for every a; G A(r) > > 1 — 5 



r 3 , 16e 4 , 2, 
whenever n > max] — log , — log - } 

e e e o 

Theorem [3] now follows by taking e := ^£-, and observing that N r (x; V) = ^{KeD r (x)}> the number 
of nodes in D r (x). ■ 



